[WIP] Demonstration provider #4988

chriselion · 2021-02-22T23:51:57Z

Proposed change(s)

Very rough WIP to convert GAIL and BC to use a new DemonstrationProvider interface.

The eventual goal (beyond the scope of this PR) is to let users define their own DemonstrationProvider interface in a plugin and use that instead of the given LocalDemonstrationProvider.

In this PR:

Add DemonstrationProvider interface
Break up the previous demo_loader() code into demonstration_proto_utils and LocalDemonstrationProvider impl
Convert GAIL and BC to use DemonstrationProvider.

TODO

Fix tets
Decide what fields should actually be on DemonstrationExample/DemonstratoinTrajectory
Remove old demo_loader code

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

https://jira.unity3d.com/browse/MLA-1734

Types of change(s)

New feature
Code refactor

Checklist

Added tests that prove my fix is effective or that my feature works
Updated the changelog (if applicable)

Other comments

chriselion · 2021-02-22T23:52:56Z

ml-agents/mlagents/trainers/demonstrations/demonstration_proto_utils.py

+
+
+@timed
+def load_demonstration(


was demo_loader.load_demonstration

chriselion · 2021-02-22T23:58:57Z

ml-agents/mlagents/trainers/demonstrations/local_demonstration_provider.py

+        return trajectories
+
+    @staticmethod
+    def _get_demo_files(path: str) -> List[str]:


from demo_loader.get_demo_files

chriselion · 2021-02-23T00:05:53Z

ml-agents/mlagents/trainers/demonstrations/local_demonstration_provider.py

+    def get_behavior_spec(self) -> BehaviorSpec:
+        return self._behavior_spec
+
+    def pop_trajectories(self) -> List[DemonstrationTrajectory]:


Need to add docstrings here. But the idea is that GAIL, etc could be converted to use pop_trajectories() directly. Then if we want DemonstrationProviders to be able to load new demonstrations on the fly, the logic can be kept in the DemonstrationProvider and the consumer doesn't need to know about it, it just gets a fresh batch of trajectories.

chriselion · 2021-02-23T00:06:33Z

ml-agents/mlagents/trainers/demonstrations/demonstration_provider.py

+from mlagents.trainers.trajectory import ObsUtil
+
+
+class DemonstrationExperience(NamedTuple):


These are trimmed down versions of AgentExperience and Trajectory classes, based on what's currently in demo_loader.

Hmm, I feel like we shouldn't duplicate the conversion code here between AgentExperience and Trajectory (esp. with the teammate observations coming in, it becomes quite a fat function - and at some point I imagine we'll have teammate demonstrations as well).

Wonder if we can have a BaseAgentExperience be the base class that is used here and in the AgentProcessor, and have the AgentExperience (PolicyAgentExperience?) inherit from it? Or some other way of composing these two.

chriselion · 2021-03-03T19:39:31Z

Shelving this for now, since research won't be able to make use of it for a while, and we're still not sure on the trajectory/experience handling.

Chris Elion added 7 commits February 1, 2021 17:35

WIP

eb4821e

Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider

6181acb

WIP

c4d0852

Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider

b934340

Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider

d95b126

demo-specific exp and traj

46f99e6

cleanup, don't store mask

92377b5

chriselion commented Feb 22, 2021

View reviewed changes

chriselion commented Feb 23, 2021

View reviewed changes

Base automatically changed from master to main February 25, 2021 19:16

chriselion closed this Mar 3, 2021

github-actions bot locked as resolved and limited conversation to collaborators Mar 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Demonstration provider #4988

[WIP] Demonstration provider #4988

Uh oh!

chriselion commented Feb 22, 2021 •

edited

Loading

Uh oh!

chriselion Feb 22, 2021

Uh oh!

chriselion Feb 22, 2021

Uh oh!

chriselion Feb 23, 2021

Uh oh!

chriselion Feb 23, 2021

Uh oh!

ervteng Feb 23, 2021

Uh oh!

chriselion commented Mar 3, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		from mlagents.trainers.trajectory import ObsUtil


		class DemonstrationExperience(NamedTuple):

[WIP] Demonstration provider #4988

[WIP] Demonstration provider #4988

Uh oh!

Conversation

chriselion commented Feb 22, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed change(s)

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Checklist

Other comments

Uh oh!

chriselion Feb 22, 2021

Choose a reason for hiding this comment

Uh oh!

chriselion Feb 22, 2021

Choose a reason for hiding this comment

Uh oh!

chriselion Feb 23, 2021

Choose a reason for hiding this comment

Uh oh!

chriselion Feb 23, 2021

Choose a reason for hiding this comment

Uh oh!

ervteng Feb 23, 2021

Choose a reason for hiding this comment

Uh oh!

chriselion commented Mar 3, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chriselion commented Feb 22, 2021 •

edited

Loading